NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BaboonLand Dataset: Tracking Primates in the Wild and Automating Behaviour Recognition from Drone Videos

https://doi.org/10.1007/s11263-025-02493-5

Duporge, Isla; Kholiavchenko, Maksim; Harel, Roi; Wolf, Scott; Rubenstein, Daniel I; Crofoot, Margaret C; Berger-Wolf, Tanya; Lee, Stephen J; Barreau, Julie; Kline, Jenna; et al (June 2025, International Journal of Computer Vision)

Using unmanned aerial vehicles (UAVs) to track multiple individuals simultaneously in their natural environment is a powerful approach for better understanding the collective behavior of primates. Previous studies have demonstrated the feasibility of automating primate behavior classification from video data, but these studies have been carried out in captivity or from ground-based cameras. However, to understand group behavior and the self-organization of a collective, the whole troop needs to be seen at a scale where behavior can be seen in relation to the natural environment in which ecological decisions are made. To tackle this challenge, this study presents a novel dataset for baboon detection, tracking, and behavior recognition from drone videos where troops are observed on-the-move in their natural environment as they move to and from their sleeping sites. Videos were captured from drones at Mpala Research Centre, a research station located in Laikipia County, in central Kenya. The baboon detection dataset was created by manually annotating all baboons in drone videos with bounding boxes. A tiling method was subsequently applied to create a pyramid of images at various scales from the original 5.3K resolution images, resulting in approximately 30K images used for baboon detection. The baboon tracking dataset is derived from the baboon detection dataset, where bounding boxes are consistently assigned the same ID throughout the video. This process resulted in half an hour of dense tracking data. The baboon behavior recognition dataset was generated by converting tracks into mini-scenes, a video subregion centered on each animal. These mini-scenes were annotated with 12 distinct behavior types and one additional category for occlusion, resulting in over 20 hours of data. Benchmark results show mean average precision (mAP) of 92.62% for the YOLOv8-X detection model, multiple object tracking precision (MOTP) of 87.22% for the DeepSORT tracking algorithm, and micro top-1 accuracy of 64.89% for the X3D behavior recognition model. Using deep learning to rapidly and accurately classify wildlife behavior from drone footage facilitates non-invasive data collection on behavior enabling the behavior of a whole group to be systematically and accurately recorded. The dataset can be accessed at https://baboonland.xyz.
more » « less
Free, publicly-accessible full text available June 16, 2026
WildWing: An open‐source, autonomous and affordable UAS for animal behaviour video monitoring

https://doi.org/10.1111/2041-210X.70018

Kline, Jenna; Zhong, Alison; Irizarry, Kevyn; Stewart, Charles V; Stewart, Christopher; Rubenstein, Daniel I; Berger‐Wolf, Tanya (March 2025, Methods in Ecology and Evolution)

Abstract Drones have become invaluable tools for studying animal behaviour in the wild, enabling researchers to collect aerial video data of group‐living animals. However, manually piloting drones to track animal groups consistently is challenging due to complex factors such as terrain, vegetation, group spread and movement patterns. The variability in manual piloting can result in unusable data for downstream behavioural analysis, making it difficult to collect standardized datasets for studying collective animal behaviour.To address these challenges, we present WildWing, a complete hardware and software open‐source unmanned aerial system (UAS) for autonomously collecting behavioural video data of group‐living animals. The system's main goal is to automate and standardize the collection of high‐quality aerial footage suitable for computer vision‐based behaviour analysis. We provide a novel navigation policy to autonomously track animal groups while maintaining optimal camera angles and distances for behavioural analysis, reducing the inconsistencies inherent in manual piloting.The complete WildWing system costs only $650 and incorporates drone hardware with custom software that integrates ecological knowledge into autonomous navigation decisions. The system produces 4 K resolution video at 30 fps while automatically maintaining appropriate distances and angles for behaviour analysis. We validate the system through field deployments tracking groups of Grevy's zebras, giraffes and Przewalski's horses at The Wilds conservation centre, demonstrating its ability to collect usable behavioural data consistently.By automating the data collection process, WildWing helps ensure consistent, high‐quality video data suitable for computer vision analysis of animal behaviour. This standardization is crucial for developing robust automated behaviour recognition systems to help researchers study and monitor wildlife populations at scale. The open‐source nature of WildWing makes autonomous behavioural data collection more accessible to researchers, enabling wider application of drone‐based behavioural monitoring in conservation and ecological research.
more » « less
Free, publicly-accessible full text available March 10, 2026
Characterizing and Modeling AI-Driven Animal Ecology Studies at the Edge

https://doi.org/10.1109/SEC62691.2024.00025

Kline, Jenna; O'Quinn, Austin; Berger-Wolf, Tanya; Stewart, Christopher (December 2024, IEEE)

Full Text Available
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis

Chowdhury, Arpita; Paul, Dipanjyoti; Mai, Zheda; Gu, Jianyang; Zhang, Ziheng; Mehrab, Kazi Sajeed; Campolongo, Elizabeth G; Rubenstein, Daniel; Stewart, Charles V; Karpatne, Anuj; et al (June 2025, Proceedings of the Computer Vision and Pattern Recognition Conference)

We present a simple approach to make pre-trained Vision Transformers (ViTs) interpretable for fine-grained analysis, aiming to identify and localize the traits that distinguish visually similar categories, such as bird species. Pre-trained ViTs, such as DINO, have demonstrated remarkable capabilities in extracting localized, discriminative features. However, saliency maps like Grad-CAM often fail to identify these traits, producing blurred, coarse heatmaps that highlight entire objects instead. We propose a novel approach, Prompt Class Attention Map (Prompt-CAM), to address this limitation. Prompt-CAM learns class-specific prompts for a pre-trained ViT and uses the corresponding outputs for classification. To correctly classify an image, the true-class prompt must attend to unique image patches not present in other classes' images (i.e., traits). As a result, the true class's multi-head attention maps reveal traits and their locations. Implementation-wise, Prompt-CAM is almost a "free lunch," requiring only a modification to the prediction head of Visual Prompt Tuning (VPT). This makes Prompt-CAM easy to train and apply, in stark contrast to other interpretable methods that require designing specific models and training processes. Extensive empirical studies on a dozen datasets from various domains (e.g., birds, fishes, insects, fungi, flowers, food, and cars) validate the superior interpretation capability of Prompt-CAM. The source code and demo are available at https://github.com/Imageomics/Prompt_CAM.
more » « less
Free, publicly-accessible full text available June 1, 2026
Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images

Mehrab, Kazi Sajeed; Maruf, M; Daw, Arka; Neog, Abhilash; Manogaran, Harish Babu; Khurana, Mridul; Feng, Zhenyang; Altintas, Bahadir; Bakis, Yasin; Campolongo, Elizabeth; et al (June 2025, CVPR)

The availability of large datasets of organism images combined with advances in artificial intelligence (AI) has significantly enhanced the study of organisms through images, unveiling biodiversity patterns and macro-evolutionary trends. However, existing machine learning (ML)-ready organism datasets have several limitations. First, these datasets often focus on species classification only, overlooking tasks involving visual traits of organisms. Second, they lack detailed visual trait annotations, like pixel-level segmentation, that are crucial for in-depth biological studies. Third, these datasets predominantly feature organisms in their natural habitats, posing challenges for aquatic species like fish, where underwater images often suffer from poor visual clarity, obscuring critical biological traits. This gap hampers the study of aquatic biodiversity patterns which is necessary for the assessment of climate change impacts, and evolutionary research on aquatic species morphology. To address this, we introduce the Fish-Visual Trait Analysis (Fish-Vista) dataset—a large, annotated collection of about 80K fish images spanning 3000 different species, supporting several challenging and biologically relevant tasks including species classification, trait identification, and trait segmentation. These images have been curated through a sophisticated data processing pipeline applied to a cumulative set of images obtained from various museum collections. Fish-Vista ensures that visual traits of images are clearly visible, and provides fine-grained labels of various visual traits present in each image. It also offers pixel-level annotations of 9 different traits for about 7000 fish images, facilitating additional trait segmentation and localization tasks. The ultimate goal of Fish-Vista is to provide a clean, carefully curated, high-resolution dataset that can serve as a foundation for accelerating biological discoveries using advances in AI. Finally, we provide a comprehensive analysis of state-of-the-art deep learning techniques on Fish-Vista.
more » « less
Free, publicly-accessible full text available June 15, 2026
ML Field Planner: Analyzing and Optimizing ML Pipelines For Field Research

https://doi.org/10.1145/3708035.3736013

Stubbs, Joe; Balasubramaniam, Sowbaranika; Khuvis, Samuel; Withana, Sachith; Vallabhajosyula, Manikya Swathi; Cardone, Richard; Garcia, Christian; Freeman, Nathan; Guzman, Carlos; Plale, Beth; et al (July 2025, ACM)

Free, publicly-accessible full text available July 18, 2026
Harnessing artificial intelligence to fill global shortfalls in biodiversity knowledge

https://doi.org/10.1038/s44358-025-00022-3

Pollock, Laura J; Kitzes, Justin; Beery, Sara; Gaynor, Kaitlyn M; Jarzyna, Marta A; Mac_Aodha, Oisin; Meyer, Bernd; Rolnick, David; Taylor, Graham W; Tuia, Devis; et al (March 2025, Nature Reviews Biodiversity)

Large, well described gaps exist in both what we know and what we need to know to address the biodiversity crisis. Artificial intelligence (AI) offers new potential for filling these knowledge gaps, but where the biggest and most influential gains could be made remains unclear. To date, biodiversity-related uses of AI have largely focused on tracking and monitoring of wildlife populations. Rapid progress is being made in the use of AI to build phylogenetic trees and species distribution models. However, AI also has considerable unrealized potential in the re-evaluation of important ecological questions, especially those that require the integration of disparate and inherently complex data types, such as images, video, text, audio and DNA. This Review describes the current and potential future use of AI to address seven clearly defined shortfalls in biodiversity knowledge. Recommended steps for AI-based improvements include the re-use of existing image data and the development of novel paradigms, including the collaborative generation of new testable hypotheses. The resulting expansion of biodiversity knowledge could lead to science spanning from genes to ecosystems — advances that might represent our best hope for meeting the rapidly approaching 2030 targets of the Global Biodiversity Framework.
more » « less
Free, publicly-accessible full text available March 1, 2026
When to Use Demographic Data in Healthcare Models: A Bias-Responsible Approach

https://doi.org/10.1109/CAI59869.2024.00276

Zeleke, Sebrina; Berger-Wolf, Tanya; Ning, Xia (June 2024, IEEE)

Full Text Available
Fine-Tuning is Fine, if Calibrated

Mai, Zheda; Chowdhury, Arpita; Zhang, Ping; Tu, Cheng-Hao; Chen, Hong-You; Pahuja, Vardaan; Berger-Wolf, Tanya; Gao, Song; Stewart, Charles; Su, Yu; et al (December 2024, NeurIPS)

Full Text Available
Reviving the Context: Camera Trap Species Classification as Link Prediction on Multimodal Knowledge Graphs

Pahuja, Vardaan; Luo, Weidi; Gu, Yu; Tu, Cheng-Hao; Chen, Hong-You; Berger-Wolf, Tanya; Stewart, Charles; Gao, Song; Chao, Wei-Lun; Su, Yu (October 2024, ACM International Conference on Information and Knowledge Management (CIKM))

Full Text Available

« Prev Next »

Search for: All records